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ABSTRACT 

Photo-z errors, especially catastrophic errors, are a major uncertainty for precision 
weak lensing cosmology. We find that the shear- (galaxy number) density and density- 
density cross correlation measurements between photo-z bins, available from the same 
lensing surveys, contain valuable information for self-calibration of the scattering prob- 
abilities between the true-z and photo-z bins. The self-calibration technique we propose 
does not rely on cosmological priors nor parameterization of the photo-z probability 
distribution function, and preserves all of the cosmological information available from 
shear-shear measurement. We estimate the calibration accuracy through the Fisher 
matrix formalism. We find that, for advanced lensing surveys such as the planned 
stage IV surveys, the rate of photo-z outliers can be determined with statistical uncer- 
tainties of 0.01-1% for z < 2 galaxies. Among the several sources of calibration error 
that we identify and investigate, the galaxy distribution bias is likely the most dom- 
inant systematic error, whereby photo-z outliers have different redshift distributions 
and/or bias than non-outliers from the same bin. This bias affects all photo-z calibra- 
tion techniques based on correlation measurements. Galaxy bias variations of O(0. 1) 
produce biases in photo-z outlier rates similar to the statistical errors of our method, 
so this galaxy distribution bias may bias the reconstructed scatters at several-c level, 
but is unlikely to completely invalidate the self-calibration technique. 

Key words: (cosmology:) large-scale structure of Universe: gravitational lensing: 
theory: observations 



1 INTRODUCTION 

Weak gravitational lensing 
most powerful pro bes of 



is emerging as one of the 
dark matter, dark energy 
|Albrecht et al. 2006T) and the nat ure of gravity at cosmo- 
logical scales ( Jain fc Zhang 2008 ). In less than a decade 
after first detections jBacon et allhoOOl; iKaiser et alj [2000 ; 
IVan Waerbeke et al.l [20001 : IWittman et alJl200Ch . the lens- 
ing measurement accuracy a nd dynamical r ange have been 
improved dramatically (e.g. iFu et all [2008). Future weak 
lensing surveys have the potential to measure the lensing 
power spectrum with sub-1% statistical accuracy for many 
multipole £ bins. However, whether we can fully utilize this 
astonishing capability is up to the control over various sys- 
tematic errors. They could arise from uncertainties in the- 
oretical modeling, including the non-l i near e volution of the 

]) and the influ- 
200i : |jing et ail 



includes the galaxy intrinsic alignment jHirata et alj 2004 ■ 
Mandelbaum et al.ll2006l: iHirata et al.ll2007l : IOkumura et all 
20091: lOkumura fc Jin j|2009h . influence of the telescope PSF 
|Hevmans et al.ll2006l : lMassev et al.ll2007[). photometric red- 
shift (photo-z) calibration errors "(|Ma. Hu. fc Huterer|[200rj : 
iBernstein fc Hutererl [20091 ) ■ etc. Precision lensing cosmol- 
ogy puts stringent r equirements on calibrating these errors 



ogy puts stringent r t 
ijHuterer et al.l l2006) 



universe jHeitmann et all 120051. 120081. 1200' 
ence o f baryons (IWhitdl2004l ; IZhan fc Kno: 



l200d : iRudd et al.ll2008l ). They could also arise from un- 
certainties in the lensing measurement. An incomplete list 



Weak lensing surveys are rich in physics and con- 
tain information beyond the cosmic shear power spectrum 
|Bernsteinl [20091 : IZhand |2008l ). This bonus allows for self- 
calibration of weak lensing systematic errors, such as the 
galax y intrinsic alignment (jZhandfeoOSl ; Ijoachimi fc~B ridlc 
2009). In the present paper, we utilize the galaxy density- 
shear cross-correlation and density-density correlations in 
photometric survey data to self-calibrate the photo-z scat- 
ters between redshift bins. Namely, for a given photo-z bin, 
we want to figure out (reconstruct) the fraction of galax- 
ies which are actually located in a distinct redshift bin. 
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These scatters quantitatively describe the photo-z outliers 
or catastrophic errors. The effect of photo-z outliers on cos- 
mological inf erence from the shear - shear power spectrum is 
discussed bv lBernstein fc Hutererl (|2009l ). which also quan- 
tifies the task of calibrating these outliers by direct spec- 
troscopic sampling of the galaxy population. Since spectro- 
scopic sampling of faint galaxies at > 99% completeness is 
an expens ive or infe a sible task for current ground-based ca- 
pabilities, [Newman (2008!) proposes a technique based on 
cross-correlation between the photo-z sample and an incom- 
plete spectro-z sample. 

Here we ask a complementary question: how well can 
the outlier rate be determined using purely photometric 
data from the origina l lensing survey? As pointed out by 
ISchneider et al.l (|2006t ). the spurious cross correlation be- 
tween the galaxy density in two photo-z bins can be ex- 
plored to calibrate photo-z errors. They found that, for a 
photo-z bin of the size Az — 0.5 in a LSST-like survey, 1% 
level scatters can be identified and the mean redshift can be 
calibrated within the accuracy of ~ 0.01. This result is im- 
pressive. Unfortunately, a factor of 10 improvement is still 
required to meet the statistic al accuracy of those ambitious 
"stage IV" projects (~ 10~ 3 . iHuterer et alJ [2006). In com- 
bination with baryon acoustic oscillation and weak lensing 
measuremen ts, the constraints can be significantly improved 
(Zhan 2006). However, these procedures adopt a number of 
priors/parameterizations, which may bias the calibration. 

A successful self-calibration should not correlate cos- 
mological uncertainties with astrophysical uncertainties (in 
our case, photo-z errors). To meet this requirement, the self- 
calibration should adopt as few cosmological priors as pos- 
sible, preferably none. On the other hand, it should not re- 
sult in loss of cosmological information. In this paper, we 
propose to combine the galaxy-galaxy clustering measure- 
ment and shear-galaxy cross correlation measurement to 
perform the photo-z self-calibration, strictly reserving the 
shear-shear measurement for cosmology. Finally, it must be 
able to reach sufficiently high statistical accuracy and have 
controllable systematics, if any. As we will show, this self- 
calibration meets all the three requirements and is able to 
detect scatters as low as 0.01%. 

There are a number of differences between our 
self-calibration metho d and the method proposed by 
ISchneider et al.l (|2006t ) . (1) The inclusion of shear-galaxy 
cross correlation measurement breaks a severe degeneracy 
in the previous method and thus significantly improves the 
calibration accuracy. (2) We do not adopt any parameteriza- 
tions on the photo-z probability distribution function (PDF) 
and are thus free of possible bias induced by improper pa- 
rameterizations. (3) This method is a true self-calibration, in 
the sense that the photo-z scatters are reconstructed solely 
from the given weak lensing surveys, no external measure- 
ments nor priors on cosmology and galaxy bias, are neededQ 
(4) We argue that the shot noise is the only relevant noise 
term for the likelihood analysis. Sample variance and non- 
Gaussianity, do not affect the reconstruction. This allows 
us to go deeply into the nonlinear regime, gain many more 



1 Of course, the calibration accuracy depends on the fiducial 
photo-z PDF or the actual photo-z PDF in the given survey. 



independent modes for the reconstruction, and significantly 
improve the reconstruction accuracy. 

Like most of these predecessor papers, we analyze 
an idealized survey: apparent density fluctuations induced 
by gravitational lensing are ignored; galaxy biasing is as- 
sumed to be common to all galaxies at a given red- 
shift; and shear measurement errors are ignored. Incor- 
poration of these and other effects can substantially de- 
grade the correlation-base d photo-z calibration methods 
l)Bernstein fc Hutererl [20091 ) . A framework for comprehen- 
sive analysis of photo metric+sp e ctrosc opic lensing survey 
data is presented in iBernsteinl (|2009l ). and is necessary 
to make a final judgment on the efficacy of photo-z self- 
calibration. However this complicated analysis has not yet 
been applied to the problem of photo-z outlier calibration. 
The simpler analysis presented here will demonstrate that 
the galaxy-shear and galaxy-galaxy correlations contain suf- 
ficient information to measure the outlier rate to useful pre- 
cision, and we will then examine the possibility of degrada- 
tion by the non-ideal effects. 

This paper is organized in the following way. In $2] we 
describe our self-calibration technique and target a fiducial 
"Stage IV" lensing survey for the error forecast. We discuss 
in ij3] possible systematic errors which can be incorporated 
into our technique and will not bias the photo-z PDF recon- 
struction. Other systematic errors cannot be self-calibrated 
without strong priors or external information. For these, we 
quantify the induced bias in Sg] We further discuss uncer- 
tainties in the error forecast due to uncertainties in the fidu- 
cial model and the robustness of our self-calibration tech- 
nique (©. We discuss possibilities to improve the calibra- 
tion accuracy (§6}. We also include two appendices f€Al fc 
[Bjl for technical details of the Fisher matrix analysis and 
bias estimation. 



2 PHOTO-Z SELF-CALIBRATION 

We first define several key notations used throughout the 
paper. 

• The superscript "P" denotes the property in the photo- 
z bin. 

• The superscript "R" denotes the corresponding prop- 
erty in the true-z bin. 

• The capital "G" denotes gravitational lensing, to be 
more specific, the lensing convergence converted from the 
more direct observable cosmic shear. 

• The little "g" denotes galaxy number density (or over- 
density) . 

We split galaxies into N z photo-z bins. The j-th photo- 
z bin has the range [zj — Azj/2, Zj + Azj/2). Our notation 
is that larger i means higher photo-z. Nj, the total number 
of galaxies in the j-th photo-z bin, is an observable. We 
also have N z true-z bins, with the choice of redshift range 
identical to that of the photo-z bins. We denote Ni->j as the 
total number of galaxies in the j-th photo-z bin which also 
belong to the i-th true-z bin, namely, whose true redshift fall 
in [zi — Azi/2, Zi + Azi/2). The process i — >■ j is similar to 
scatters (transitions) between different quantum states. So 
we often call this process as the photo-z scatter. The scatter 
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rate is related to the the photo-z probability distribution 
function, p(z\z p ), by the following relation 



n(z P )dz P 



p(z\z )dz 



(1) 



Here, n(z p )dz p is the number distribution of galaxies in 



the photo-z space. We define pi 



Ni->j/Nj, which repre- 



sents the averaged p(z\z p ) over the relevant redshift ranges, 
or the binned photo-z PDF. In the limit that Azij —¥ 0, 
Pi^j -> p(zi\z p ). Since J2iP^i = we nave ~ 1) 

independent These pi->j together represent a non- 

parametric description of the photo-z PDF p(z\z p ) and com- 
pletely describe the scattering probabilities between redshift 
bi ns, namely the rate of pho to-z outliers, or leakage rate. 
In lBernstein fc Hutererl (|2009h . they are called "contamina- 
tion" coefficients. 

Scatters in photo-z, especially catastrophic photo-z er- 
rors, cause a number of spurious correlations between photo- 
z bins. Our proposal is to reconstruct all Pi->j from these 
cross correlations. For convenience, we will work on the cor- 
responding cross power spectra throughout the paper, in- 
stead of the cross correlation functions. 

We use fiducial power spectra, fiducial leakage Pi->j and 
the survey specification to perform the error forecast. To 
generate the fiducial power spectra, we adopt a flat ACDM 



cosmology with Q„ 



0.27, Qa = 1 - On, fit = 0.044, 



as = 0.84 and h = 0.71. The transfer functio n is obtained 
using CMBFAST jSeliak fc Zaldarriagal [l996h . The nonlin- 
ear matte r power spec t rum is calculated using the fitting 
formula of lSmith et al.l ((2003). Unless specified, we adopt a 
fiducial galaxy bias b g = 1. We are then able to calculate 
the galaxy power spectra and lensing-galaxy power spectra. 
The calculation of these power spectra is by no mean pre- 
cise, due to the simplification of b g = 1. However, this simple 
bias model suffices for the purpose of this paper, namely, to 
demonstrate the feasibility of our self-calibration technique. 
We will further investigate the impact of scale dependent 
bias in fj5l where we find that our self-calibration technique 
is also applicable. 

We also need the galaxy distribution n(z p ) and the in- 
put of fiducial Pi^j. Due to uncertainties in survey speci- 
fications, we will not focus on any specific survey. Instead, 
we will target a fiducial lensing survey with some charac- 
teristics of "Stage IV" lensing surveys like LSStH Euclicfl 
and JDEM. The analysis can be redone straightforwardly to 
incorporate changes i n the survey specifica tions. We follow 
iHuterer et all l|200fjh : IZhan fc Knox] (|2006t ) and adopt the 
photo-z distribution n(z p )dz p = x 2 exp(— x)dx/2, where 
x = z p /zq. For such a distribution, the median redshift is 
z m — 2.675zo. We adopt zq = 0.45, the mean galaxy surface 
density n g — 40 per arcmin 2 , and the fractional sky cover- 
age / s ky = 0.5. The rms dispersion in the shear measure- 
ment induced by the galaxy intrinsic ellipticities is adopted 
as 7 rms = 0.2. 

The fiducial p(z\z p ) and Pi^j are c alcula ted from the 
simulated data of iBernstein fc Hutererl (|2009l ). which were 




Figure 1. The fiducial z -z scatters adopted fo r the fore- 
cast. The simulated data feernstein fc Hut crcr 2009) has 177210 
galaxies. To control the size of the figure, we only show 2% of 
them, randomly chosen Although there are indeed 0.7% galaxies 
at z p > 4, in the analysis we have taken the freedom to disre- 
gard these galaxies. To make the analysis closed, we shall adopt 
the approximation that no galaxies with z p < 4.0 comes from 
z s > 4.0. We check that galaxies with z s > 4.0 only account 
for a tiny fraction of the whole z p < 4 sample (0.06%) and even 
smaller fraction for low photo-z bins ( and thus not showing up 
in this figure since we only plot 2% of all galaxies). So this ap- 
proximation is sufficiently accurate for the purpose of this paper. 



produced using the method described in IJouvel et af] f2009). 
The z p -z s distribution in this data is shown in Fig.[T] where 
z s is the spectroscopic redshift, which can be approximated 
as the true redshift. This simulated data is for SNAP-like 
space weak lensing surveys, conducted in 8 broad bands 
spanning 0.38-1.7/xm wavelength. These surveys have near- 
IR bands and hence likely smaller photo-z errors than LSST. 
Since we find that the statistical accuracy of the photo-z re- 
construction is not very sensitive to the choice of fiducial 
photo-z PDF and since photo-z calibration has room to im- 
prove, the adopted photo-z PDF should be a good represen- 
tative case to illustrate our method. 

Furthermore, the accuracy of the self-calibration is 
sensitive to the choice of redshift bins. Unless other- 
wise specified, we will adopt eight redshift bins, [0.0,0.5), 
[0.5,1.0), [1.0,1.5), [1.5,2.0), [2.0,2.5), [2.5,3.0), [3.0,3.5) 
and [3.5,4.0). For comparison, we also investigate the case 
of much finer redshift bins, [0.0,0.1), [0.1,0.2), . . ., [1.7,1.9), 
[1.9, 2.2), [2, 2, 2.6), [2.6, 3.0), [3.0, 4.0). We call these coarse 
bins and fine bins, respectively. We have utilized the free- 
dom in the data analysis and disregarded the galaxies with 
z p > 4.0. These galaxies only account for 0.7% of the to- 



2 In general, Pi->j and Pj->i are independent and pj _» 7 - ^ Pj- *i- 

3 Large Synoptic Survey Telescope, http://www.lsst.org/lsst 

4 http://www.astro.ljmu.ac.uk/ airs2008/docs/euclid_astronet%202.pdHoss of information. On the other hand, these z p > 4 galaxies 



tal galaxies. Neglecting them does not result in significant 
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may have large photo-z catastrophic errors (Fig.[T]). Neglect- 
ing them helps to reduce systematic errors. 

To close the scattering process, we need to assume that 
no galaxies with z p < 4.0 come from z s > 4.0. Under this 
apssumption, what our self-calibration technique actually 
does, is to assign those z s > 4 galaxies into the [3.5, 4.0) 
true-z bin, instead of randomly assigning them elsewhere(j 
We argue that this approximation is sufficiently accurate, for 



two reasons. First, in the simulated z 



s 



data we used, the 



total fraction of galaxies with z p < 4.0 and z s > 4.0 is only 
0.06%. Second, about 90% of them leak into the [3.5, 4.0) 
photo-z bin. Although the contamination rate in this redshift 
bin is high (4.8%), the induced error in lensing modelling is 
small, since the lensing weighting kernel varies slowly at z ~ 
4. Contaminations to other photo-z bins are much smaller. 
These z s > 4 galaxies account for 0.01% of total galaxies in 
the [0.0, 0.5) photo-z bin, 0.02% in the [0.5, 1.0) and [3.0, 3.5) 
photo-z bins. In the adopted simulated data, we detect no 
such galaxies in photo-z bins 6 [1.0, 3.0). Due to these tiny 
fractions, mis-assignment of these galaxies is not a limiting 
factor of our self-calibration technique. 
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2.1 Galaxy-galaxy clustering 

Photo-z errors, especially catastrophic errors, induce non- 
zero galaxy-galaxy correlation between different redshift 
bins, which should vanish at sufficiently small scale, where 
the Limber approximation holds. This set of correlations 
has been explored to perform the photo-z self-calibration 
jSchneider et al.ll2006h . With the presence of photo-z scat- 
ters, the measured galaxy surface density in a given photo-z 
bin is the combination of the corresponding ones in the true 
redshift bins, 

= (2) 

fc 

The galaxy power spectrum between the i-th and j-th photo- 
z bins is 

^it ~ 'y^,P k ^ i P m ~>-^km R — ^Jpfc->»Pfe->j7^'fcfc' Jt ■ (3) 

km k 

The last equation has approximated = 0, which holds 

under the Limber approximation. At sufficiently large angu- 
lar scales, the Limber approximation fails and the intrinsic 
cross correlation Cjffi m 7^ 0. For this reason, we exclude the 
modes with £ < 100. We will further discuss this issue later 
in the paper ( H3.3|l . 

If the relevant photo-z scatters are sufficiently large, 
C>iS'f will dominate the associated shot noise. In this case, 
the relevant photo-z scatters become detectable. For the ad- 
jacent bins — j\ = 1), the dominant contribution to C?^' P 
obviously comes from Pi->j and Pj-n, unless the correspond- 
ing Pi->j and are tiny, which is unlikely. For correlations 
between non-adjacent photo-z bins, this may not be true. In 
Fig. [2] we show the result of C||' P . In this case, the dom- 
inant contribution comes from Cff'^, through the scatters 



Figure 2. Diagnostics of photo-z errors. Scatters between redshift 
bins caused by photo-z errors induce non-zero correlations C?J'. P 

and C?.*' p ', which otherwise vanish. Here i,j = 1,2... denote 
different redshift bins, the larger the the higher the redshift. 
These spurious cross correlations serve as diagnostics of photo-z 
scatters. We show the resulting C||' P and C^ 9 '^. The errorbars 
shown are from the shot noise. As explained in the text, this is 
the only relevant source of error. 

3 — > 2 and 3 — > 4. Due to the huge sky coverage of the fidu- 
cial stage IV lensing survey, even though the signal Cf|' P 
is suppressed by a huge factor (~ p3->2 X P3->4) relative to 
Cff ! h still overwhelms the shot noise and becomes ob- 
servable. 

However, an intrinsic degeneracy encoded in the galaxy- 
galaxy clustering significantly limits the accuracy of this ap- 
proach. From Eq. [31 it is clear that the correlation C 9 3' P 
can be induced by a nonzero Pi->j or by a non-zero Pj^i. In 
other words, C??' (£) (i 7^ j) only measures the combina- 
tion pj^iPj^jCjj' R (£) + pi^iPi^jCff' 11 (£) . Thus with mea- 
surement at a single multipole £ bin, the galaxy clustering 
measurement alone can not break this degeneracy between 
up and down scatters. Adding more £ bins can break this 
degeneracy — if the ratio C^' R /Cff' R varies with I. If for 
example the 3D galaxy power spectrum is a strict power-law 
over the relevant scales at the relevant redshift range, whose 
power index does not vary with redshift0 then all Cf? {£) 
are self-similar and the solution remains degenerate. Obser- 
vationally, departures from a powe r law have been fo und 
in the galaxy correlation function l|Zehavi et al.l [20041 ). so 
galaxy clustering measurements at many £ bins can break 
the above degeneracy. Nevertheless, galaxy correlation func- 
tions are observed to be close to power laws. This degrades 



5 These galaxies do not lens other galaxies so they have to be 
put in the highest redshift bin. More details on how the self- 
calibration technique rank galaxies are given in £12.31 



6 This condition is necessary. If the power index of the 3D power 
spectrum varies with redshift, the shape of CfP (£) will vary with 
redshift. 
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the reconstruction accuracy. Furthermore, the slope depends 
weakly on galaxy type. If one uses the slight deviations from 
a power law, small changes in the slope of the leaking pop- 
ulation could lead to large systematic reconstruction errors. 
This is an example of the galaxy distribution bias that we 
will scrutinize in We will further investigate this issue 
in 33 

In the next section, we will show that, due to the unique 
geometry dependence of gravitational lensing, the degener- 
acy between up and down scatters is broken naturally by 
adding the lensing-galaxy correlation, resulting in significant 
improvement in the reconstruction. 

2.2 Lensing-galaxy correlations 

Galaxy-galaxy lensing brings N% new observables for each 
angular scale, which could be used to break the previous 
degeneracy. The scatter can render the otherwise vanish- 
ing foreground lensing-background galaxy cross correlation 
non-zero (Cf^ P 7^ 0). More importantly, lensing, due to its 
geometry dependence, distinguishes up-scatters from down- 
scatters. This new piece of information is the key to signifi- 
cantly improve the photo-z self-calibration. 

Without loss of generality, we will work on the lensing 
convergence k, instead of the more direct observable cosmic 
shear 7, which are locally equivalent in Fourier spaceQ With 
the presence of photo-z scatters, the measured lensing con- 
vergence in a given photo-z bin is some linear combination of 
the ones in true-z bins weighted by the scatter probability, 

«f = ^Jpfc-fiK* ■ (4) 

k 

The cross correlation power spectrum between the lensing 
convergence in the i-th photo-z bin and the galaxy number 
density in the j-th photo-z bin is given by 

c ?/ ,p = £^^<-«. ( 5 ) 

In the absence of lensing magnification bias, C^' R 7^ 
only when the source redshifts are higher than the galaxy 
redshifts (namely, k ^ m). Discussion of magnification bias 
and other errors will be postponed to ij3]and Sj4] 

We show in Fig. [2] the case of C^j s,P as an example. It 
is mainly contributed by C^ 9,ii through the scatters 3 — > 2 
and 2 — > 3, by C^f' fi from the scatters 2 — > 3 and by G 3 ^' R 
from the scattering 3->2. Given the strength of P3->2 and 
P2->3, the resulting C^ 9,P is sufficiently large to overwhelm 
the shot noise at £ < 10 4 . We notice that, depending on 
the values of the relevant p and the size of the redshift bin, 
the dominant contribution can come from the C°J? R term, 
despite the heavy suppression in its amplitude due to the 
low amplitude of the lensing kernel over the relevant redshift 
range. 

In most of the cases, can not be measured with 

comparable accuracy to that of C 33 ' p (however, refer to 

7 Cosmic shear measurement directly measures the reduced shear 
7/(1 — k). So the above statement only holds at first order ap- 
proximation. However, this complexity does not affect our self- 
calibration, in which we do not rely on a theory to predict 7 or 
K. 



Fig. [2] for one exception) . However, we point out that it is 
valuable to include this piece of information in the photo-z 
self-calibration. It turns out to be the key to breaking the 
strong degeneracy between up and down scatters. Look at 
the configuration i ^ k > j. The scatter i — > j contributes 
to C G k 3 ' P , while the scatter j — > i does not. For this rea- 
son, it can break the degeneracy between i — >• j and j — > i, 
encountered in the self-calibration based on galaxy cluster- 
ing alone. The discriminating power relies on the intrinsic 
asymmetry between up and down scatters in generating the 
lensing effect. So it remains efficient, even in the case of self- 
similar C 33 {£) , where the self-calibration based on galaxy 
clustering alone blows up (*j5j- 



2.3 The photo-z self-calibration combining the 
galaxy-galaxy and lensing-galaxy 
measurements 

Counting degrees of freedom suggests that we should be able 
to perform a rather model-independent self-calibration with- 
out any priors. Mathematically, we need to solve Eq. 
for all Pj-n, C 3 1 ,R and C^J 11 simultaneously. At the begin- 
ning of this process, C"' p and C Ga ' p should be replaced 
with the corresponding measurements. The reconstructed 
power spectra C 33 ' R and C Gg ' R contain valuable informa- 
tion on cosmology and can be further explored. The recon- 
structed Pi->j can then be applied to the shear-shear corre- 
lation measurement to correct for bias induced by photo-z 
scatters and infer the correct cosmology. In the present pa- 
per, we will focus on only pi->j. Thus we treat C"' p and 
c a a ,p 

as nuisance parameters to be marginalized over. 
The unknown parameters to be determined simul- 
taneously by our self-calibration technique are A = 
{p^^G^ii^^ll^ih)...), with fi ^ v, i > j and 
[l, v,i,j,k = l,..., N z . For Ni multipole I bins and N z red- 
shift bins, we have N Z (N Z + 3)Ni/2 + N Z (N Z - 1) quanti- 
ties to solve {N e N z for C 9 k 9 k ' R , N e N z {N z + l)/2 from C^' R 
(k ^ m) and N Z (N Z — 1) for pfc_> m ). On the other hand, we 
have NeN z (3N z + l)/2 independent measurements of corre- 
lations. N t N 2 z of them come from C Gb ' p and N e N z (N z +l)/2 
from Cf|f 

The equations to solve are quadratic in Pi->j and linear 
in C\j (Eq. [3] & [5}. For this reason, to guarantee a unique 
solution for pt-yj, the number of measurements should be 
at least N Z (N Z — 1) larger than the number of unknowns. 
This condition is satisfied when Ne ^ 2. If all the equations 
( Eq. [3] & [5} are independent then Ne = 2 is the minimum 
requirement. If some of the equations are linear combina- 
tions of the others, we will need ATg > 2. For example, if the 
galaxy power spectra are strict power laws in £, then even 
perfect measurements of C"(£) at all £ and redshift bins 
cannot break the degeneracies in p and the self-calibration 
fails. In reality, none of the galaxy power spectra and shear- 
galaxy power spectra is strictly power law and we expect 
a valid self-calibration barring another unforeseen degener- 
acy. Furthermore, the baryon oscillations leave features in 



The measurement Cff' P is identical to the measurement 
*gg,P 
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C" an d C Gg , which help to improve the reconstruction ac- 
curacy (|Zhan fc Knoxll2006h . 

To better understand the self-calibration process, we 
recast it as a mathematical problem to assign galaxies with 
photo-z labels into correct subsets (true-z bins), based on the 
cross correlation measurements. The condition C 9 3' R = 
tells us that these subsets do not overlap with each other in 
true redshift. The condition C G ? ,R = sets the correct order 
of these true-z bins, ranking from low to high redshifts, since 
only galaxies behind a lens can be lensed. Furthermore, the 
measured power spectra have different dependences on C R 
and Pi->-j (linearly on C R while quadratically on Pi->j). This 
implies the possibility to separate p from C R . 

However, careful readers may have already noticed a 
puzzling behavior. Eq.[3]&[5]are invariant under the scaling 



0.01 ? 



C 



Gg,R 
krn 



£ — 1 

fkfmC k 



(6) 



Gg.R 



where fk are some arbitrary constants. This seems to imply 
that we are not able to determine pi->j without knowing 
C" M and C Gg ' R . However, this is not a real degeneracy. 
The reason is that we have the conditions ^2 k Pk^i = 1. 
These N z constraints uniquely fix the N z freedom of fk- In 
the Fisher matrix analysis carried out in the next section, 
we enforce the conditions E fc pfc-M = 1 by explictly setting 

Pi->< = 1 - E fc ^Pfc^- 

The above solution to this puzzling problem also leads 
to the solution of another question. Can we choose more 
true-z bins than photo-z bins in the self-calibration tech- 
nique? The answer is no. In such case, the number of un- 
known constants fk is larger than the number of constraints 
^2 k Pk^i = 1. Thus we are not able to uniquely fix fk and 
thus Pk^i- 

The absence of degeneracy in the self-calibration is, in 
the end, confirmed by the stability of our Fisher matrix 
inversion (below) for all the configurations that we have 
checked, including both the coarse bins and fine bins, various 
n(z p ) and p(z|z P ).0 

We thus believe that our photo-z self-calibration does 
work. It does not rely on any assumption of the underly- 
ing photo-z PDF. Furthermore, it does not rely on cosmo- 
logical priors, since all cosmology-dependent quantities (e.g. 
the galaxy-galaxy and lensing-galaxy power spectra) are self- 
calibrated simultaneously. So the reconstructed p is indepen- 
dent of uncertainties in cosmology. In the next section, we 
will quantify the reconstruction error and show that the pro- 
posed self-calibration is indeed powerful. For these reason, it 
can be and should be applied to ongoing and proposed weak 
lensing surveys such as CFHTLEo, DESlEI, LSST, JDEM 
and Euclid. 
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Figure 3. The forecasted error in Pi— >j for the fiducial stage IV 
lensing survey. The labels in the plots are the photo-z bins. The 
horizontal axis is the true redshift. The filled circles are the results 
including both galaxy-galaxy correlations and shear-galaxy corre- 
lations. The open squares only use information in galaxy-galaxy 
correlations. We only show the cases with i ^ j, since pi—,i is not 
independent. The horizontal coordinate is z;, the middle point of 
the i-th photo-z bin, instead of the averaged true redshift (zj). 



Before quantifying the reconstruction accuracy, we want 
to address a fundamental limitation of this self-calibration 
technique. It is designed to diagnose scatters between redshift 
bins. It is thus completely blind to photo-z errors which do 
not cause such scatter. Any one-to-one mapping between 
photo-z and true-z preserves C?3' P = and cannot be 
discriminated using galaxy-galaxy correlations. Some such 
photo-z errors can cause Cfj?- P 7^ 0, so we still have some 
discriminating power left. If, however, the mapping between 
photo-z and true-z is monotonically increasing, then we have 
GfS'f = and Cf^ P = 0, and our self-calibration technique 
completely lacks the capability to detect such a photo-z er- 
ror. One simple example is z p = (I + e)z, where e is a 
constant. We must rely on spectroscopic redshift measure- 
ments to diagnose such errors. In other words, our method 
can determine only the scattering matrix of photo-z's, and 
is insensitive to any recalibration of the mean photo-z's. In 
this paper, we assume no such mean photo-z error exists. 
This is certainly a crucial point of further investigation. 



9 To be more strict, the stability of the Fisher matrix inversion 
means that the solution is a local maximum in the likelihood 
space, since the Fisher matrix is based on the Taylor expansion 
around the given solution. For the solution to be unique, in prin- 
ciple we have to go through the whole likelihood space and prove 
that it is the global maximum. This work is beyond the scope of 
this paper. 

10 http://www.cfht.hawaii.edu/Science/CFHLS/ 

11 https: / /www. darkenergysurvey.org/ 



2.4 Error estimation 

We derive the likelihood function and adopt the Fisher 
matrix formalism to estimate the capability of our self- 
calibration technique. The details are presented in the ap- 
pendices [S] & [B] We want to highlight that the error esti- 
mation here is distinctly different from that in routine ex- 
ercises of cosmologica l parameter constraints and that in 
ISchneider et al.l (|2006h . In these cases, the theory predicts 
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Figure 4. The error in the true mean redshift of each photo-z 
bin for the fiducial stage IV lensing survey. The filled circles are 
the results including both galaxy-galaxy correlations and shear- 
galaxy correlations. The open squares only use information in 
galaxy-galaxy correlations, where we can see the significant im- 
provement by adding the shear-galaxy measurements. The results 
are plotted against the middle-point of the corresponding photo-z 
bin. We caution that the galaxy number density weighted true-z, 
(zi), can differ significantly from the middle-point of the corre- 
sponding photo-z bin, mainly due to significant fractions of scat- 
ter into distant redshift bins. For example, for the photo-z bin 
[3.5,4.0), (z 8 ) = 2.97, for [3.0,3.5), (z 7 ) = 2.7 and for [0.0,0.5), 
(zi) = 0.426. 



the ensemble average power spectra, which are then com- 
pared to the data. An inevitable consequence of any power- 
spectrum determination is uncertainty due to cosmic (sam- 
ple) variance. But in our self-calibration, the cosmic variance 
does not work in this way, because the C R are fitted pa- 
rameters, not theoretical predictions. What enters into the 
key equations [3] and [5] is not the ensemble average of the 
power spectra, but the actual values in the observed cosmic 
volume, i.e. they are the sums of their ensemble averages 
and cosmic variances within the observed volume. Galax- 
ies in the same true-z bin (but different photo-z bins) share 
the same cosmic volume, thus their power spectra share the 
same sample variance (however, see tj4.ll for complexities) , 
as do their cross power spectr a with the m atter. Such coher- 
ence has been pointed out by IPenl (|2004l ) and h as been ap - 
plied to improve the weak lensing measurement (|Penll2004T ) 
and primordial non-Gau ssianity meas urement through two- 
point galaxy clustering l|Seliakl [20091 '). Furthermore, we do 
not rely on a cosmological theory to predict these power 
spectra. Instead, our self-calibration reconstructs the actual 
power spectra in the observed survey volume. For this rea- 



Figure 5. The dependence of reconstruction error on the size 
of redshift bins. The filled circles are the results showing in the 
previous figure adopting coarse redshift bins and the open circles 
adopt finer redshift bins. Changes in redshift bin size not only 
change the level of noise and the strength of signal, but also the 
number of parameters to be fitted. Along with the complicated 
error propagation, we see a quite irregular dependence on the bin 
size. 

son, the only relevant source of noise in writing down the 
likelihood function is the shot noise F^l 

This point is of crucial importance for our error analysis. 
(1) It allows us to derive the likelihood function robustly in 
essentially all I range. It is simply Gaussian, thanks to the 
central limit theorem and the stochasticity of shot noise. 
This is even true for the high-^ regime, where the under- 
lying density fields are highly nonlinear and non-Gaussian, 
but the shot noise remains Gaussian over many indepen- 

12 We also want to address that this does not mean that the 
influence of cosmic variance vanishes magically in the Fisher ma- 
trix error forecast. In fact, it enters the fiducial power spectra, 
since the fiducial ones should be those measured in a given cos- 
mic volume instead of the ensemble average. To carry out the 
Fisher matrix analysis more robustly, we need to generate many 
realizations of the fiducial power spectra, do the Fisher matrix 
analysis, and weigh the error forecast according to the probabil- 
ity of each realization of the fiducial power spectra to find out 
the final answer. The good thing is that the cosmic variance of 
each power spectrum is usually much smaller than the ensem- 
ble average (£Alf s ^ y 2> 1), given the large sky coverage of the 
fiducial stage IV lensing survey. The reconstruction error in our 
self-calibration is not sensitive to such small fluctuations in the 
fiducial power spectra. Thus, we are safe to skip the full process 
and just use the ensemble average as the fiducial power spectra. 
For forecasts of surveys with much smaller sky coverage, such as 
CFHTLS, we may need to go through the full process. Further- 
more, if we want to infer cosmology from the reconstructed power 
spectra, cosmic variance definitely enters. 
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Figure 6. The cross correlation coefficient between errors in 
p. To demonstrate the fine structure, we choose the fine red- 
shift bins (N z = 22 redshift bins in total). Both the horizon- 
tal and vertical axes are Pi->j (i ^ j), which runs with the 
order = (2, 1), (3, 1), . . . , (N z , 1), (1, 2), (3, 2) ... , (N z ,2), . . .. 
There are 462 ps in total. Errors in Pi— »j and Pk—>j are correlated, 
since they could cause some 

CGg.P ^ o. Errors in Pi^j and Pi_>fc 
are correlated too, since they cause C B ^' ^ 0. More fine struc- 
ture can be found by zooming in this figure. 

dent £ modes. (2) Since we do not rely upon any theoretical 
model for the power spectra, we do not need a theory ca- 
pable of predictions at small scales where non-linear and 
baryonic physics are important. (3) For these reasons, we 
do not need to disr egard those £ measure ments in highly 
nonlinear regime, as lSchneider et alj (|2006T ) did. The inclu- 
sion of these measurements significantly improves the p re- 
construction, especially for low redshift bins. This explains 
much of the difference b etween the recon s tructi on errors of 
this paper from that of Sch neider et all (2006), using the 
galaxy clustering measurement alone. The high-£ limit of 
this analysis will ultimately depend upon other limitations 
such as the applicability of the weak-lensing approximation 
at small scales, which can in principle render the adopted 
fiducial power spectra unrealistic and thus the error forecast 
unrealistic. However, in our exercise, we expect and have nu- 
merically confirmed that the contribution of high-£ is highly 
suppressed by the shot noise (as can be seen from Fig. [5J), 
so the reconstruction accuracy is not sensitive to the high-f 
limit. Throughout this paper, the results shown are based 
on the choice of 100 < £ < 10 5 . (4) It significantly simplifies 
the matrix inversion and improves the numerical accuracy. 
For N z ~ 10 and iV; ~ 100, the Fisher matrix to invert is 
several thousand by several thousand. However, a dominant 
portion of this Fisher matrix is block diagonal, due to the 
shot noise feature of the error sources. This allows us to sig- 
nificantly reduce the work on matrix inversion. The detail is 
explained in the appendix. 

We show the error forecast in Figs.[3][4][5]&:[6] For most 
bins at z < 2, p can be reconstructed to accuracy < 0.01, for 
either coarse or fine bins. The improvement by adding the 
shear-galaxy measurement is often better than 30%, up to 
a factor of a few. We can also compress errors in Pi^j into 



a single number cti z \ , the statistical error in the mean true 
redshift of each photo-z bin. a/ z \ by no means captures all 
information of the reconstruction error, but it is a convenient 
reference. The result of a^) f° r coarse bins is shown in Fig. 
[4] and that for fine bins is shown in Fig. [5] For the coarse 
bins, it can reach O(10 -3 ) for z < 2. The improvement by 
adding the lensing- galaxy measurement is a factor of ~ 10 
at z < 2, mainly through the improvement in constraining 
scatters from high redshift bins. a^ z ) for fine bins is larger 
(Fig. [5]l . However, these errors are tightly anti-correlated, as 
can be inferred from Fig. [5] and [6] This is the reason we see 
big improvement when choosing bigger bin size. 

We present an order of magnitude estimation to under- 
stand these numbers of the reconstruction error. For exam- 
ple, many Pi-+j can be determined within the accuracy of 
O(10~ 4 ). We take p2-u as an example. The scatter 2 — > 1 
causes Cf|' P 7^ 0. Ignoring other scatters, the threshold of 
P2->i is roughly set when the accumulated signal-to-noise of 
the Cf|' P measurement is 1, 

Here, a is the associated shot noise power spectrum. We find 
that the threshold p2->i inferred from the above equation is 
even smaller than cr P2 _ >1 , the statistical error in p2-»i (Fig. 
[3}. This is indeed expected. Due to simplifications made in 
the derivation, mainly the neglect of error propagation from 
other pi-tj and dj , the threshold of p2->i obtained from the 
above approximation is certainly a lower limit of f P2 ^ 1 . 

From similar arguments, we also expect that the statis- 
tical errors for Pi->j are large for those high redshift bins (e.g. 
i — 7,8), because the number of galaxies in these high red- 
shift bins is small and thus the shot noise is large. However, 
as explained early, C 33 ' p is not the only source of informa- 
tion for the p reconstruction. C Gg ' p can also play impor- 
tant role. For example, the reconstruction of pi->8 can reach 
an accuracy of 10 -4 . The reason is that the scatter 1 — ¥ 8 
causes Cf 3 ' P 7^ 0. The combined G-g measurements have 
S/N > 1 even for pi->8 = 10~ 4 . The Cf|' P measurement 
also contributes, but since the number of galaxies in the 8- 
th bin is only 1% of total galaxies, the associated shot noise 
is large. So its contribution is overwhelmed by that from 
Cij 9 ' P (j < 8). This explains the factor of 10 improvement 
in the pi->8 reconstruction when adding the G-g measure- 
ments. Finally, we caution that, although we are able to 
qualitatively explain some results of Fig. [3] the error proro- 
gation is complicated and the above estimation only serves 
as a convenient tool to understand Fig. [3] 

Spi^j, the errors in Pi->j, are correlated, and the cor- 
relations show rich structures. To better demonstrate these 
features, we adopt the fine redshift bins, N z = 22 in to- 
tal. We define the cross correlation coefficient between Sp, 

as Hiiiiiaja = (%i ->n $Pi 2 is > / y/ {Sp'i x > (fat ->j 3 > • The 
resulting r is shown in Fig. [6] Strong positive and nega- 
tive correlations exist for errors between many scatters into 
the same photo-z bins (p»->.j and pu^j, regions around the 
diagonal of Fig. |B). These scatters are coupled since both 
reduce Cfj! m ^ (i > m and k > m) or C'^ 9 <j (m > i and 
m > k). Scatters i —¥ j and i —¥ k are coupled, too, since 
they contribute to Cj 9 ' P 7^ 0. This explains some strong 
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(both positive and negative) correlations of the off-diagonal 
elements. 



3 EXTRA SOURCES OF STATISTICAL 
ERRORS 

Our self-calibration technique does not rely on priors on 
cosmology or photo-z distribution. In this sense, it is robust. 
However, there are still several sources of systematic error. 
Some of them, if handled properly, can be rendered into 
statistical errors, without resorting to external information, 
and will not bias our reconstruction of pi->j . We will discuss 
them in this section. The remaining of them can not be 
incorporated into the self-calibration without strong priors 
and will be discussed in [|4] 

We find that galaxy intrinsic alignment ( i|3.1[l . the mag- 
nification and size bias f i]3.2[) and the intrinsic cross correla- 
tion between different galaxy bins (JL3}, can in principle be 
incorporated into our self-calibration technique and thus do 
not bias the p reconstruction. Furthermore, we argue that 
the inclusion of these complexities is unlikely to significantly 
degrade the accuracy of our self-calibration technique. 

3.1 The intrinsic alignment 

Surprisingly, galaxy intrinsic alignments do not bias the 
Pi^i reconstructed through our self-calibration technique for 
Pi->j, although they definitely bias the inferred C Gg values. 
With the presence of the intrinsic alignment I, Eq. [5] be- 
comes 

n Gg,P _ \ r ,G g ,R 
°ij — / , Pk^iPm^j^ km 

k~>m 

+ Y,P^^[ C l 9 k R + C % M ] ■ (8) 

k 

Since we do not make any assumption on C^ k 9,R , our self- 
calibration technique automatically takes the intrinsic align- 
ment into account and measures the sum of C^' R and 
C k 3 k R . Clearly, it does not bias the reconstruction of p. On 
the other hand, it certainly affects the statistical accuracy 
of the reconstruction of p. Unless \C^' R \ S> C? 3 ' 11 , its exis- 
tence does not affect the error forecast significantly. 

If the intrinsic alignment Clf.' R depends upon galaxy 
properties, it is possible that this term will differ for outlier 
galaxies than for those correctly assigned to photo-z bin k. In 
this case a bias in pk^i may result. This behavior is similar 
to the systematics from variation of b g that are discussed in 
more detail in H4.ll 

3.2 Magnification and size bias 

In reality, the measured galaxy distribution is the one lensed 
by foreground matter distribution. The measured galaxy 
over-density then has extra contribution from the lensing. 
Besides the well known magnification bias due to the lens- 
ing magnification on galaxy flux, there is also a size bias 
due to the lensing m agnification on galaxy size l|jainll2002l : 
ISchmidt et al.|[2009h . Both can be incorporated into a func- 
tion g(F,A), determined by the flux F and size A distribu- 



tion of galaxies in the given redshift bin. The lensed galaxy 
over-density then takes the form S g — 5* S g + g{F,A)n. 

The existence of this extra term induces non-vanishing 
G?5_' R and Cfj?j R . If not taken into account, it will certainly 
bias the Pi^j reconstruction. The good thing is, at least in 
principle, the same weak lensing surveys contain the right 
information to correct for this effect. Given a lensing survey, 
we are able to split galaxies into bins of flux and size. Since 
the prefactor g is determined by the flux and size distribu- 
tion and is a measurable quantity, we are able to separate 
its effect from others. Or, alternatively, we can design an 
estimator W(F, A) such that (gW) = 0, averaged over all 
flux and size bins. The price to pay is the statistical accu- 
racy of the correlation measurement. For galaxy clustering, 
the shot noise increases by a factor (W 2 ) (b g ) 2 / {Wb g ) 2 , with 
respect to the clustering signal. Here, b g (F, A) is the galaxy 
bias. Robust modeling of this factor requires information on 
galaxies to high redshifts and faint luminosities. Further- 
more, we need to evaluate the effect of measurement error 
on galaxy flux and size. None of these exercises are trivial, 
so we postpone such studies elsewhere. However, we argue 
qualitatively that the degradation in the statistical accu- 
racy is not likely dramatic. Since b g is always positive and 
g changes sign from the bright end down to the faint end of 
the galaxy luminosity function, we do not expect a large loss 
of statistical accuracy by such weighting. However the suc- 
cess of the self-calibration will depend upon the accuracy of 
meth ods to estimate g{F,A) (see also iBernstein fc Hutererl 
120081) . 

3.3 The intrinsic galaxy cross correlation between 
non-overlapping redshift bins 

Under the Limber approximation, the galaxy cross correla- 
tion between non-overlapping redshift bins vanishes. How- 
ever, the Limber approximation is not 100% accurate. In re- 
ality, there is indeed a non-vanishing intrinsic galaxy cross 
correlation Gf5_ lR ^ 0, especially at l arge scales. As cor- 
rectly pointed out by I Schneider et aU (|2006h . this intrinsic 
cross correlation biases the reconstruction of p. Eq. should 
now be replaced by 

err 1 ' = » •• r - " • X> ->'- v r -: ,; (9) 

k k^m 

Ej~i gg ,R I \ ^ ^ gg ,R 
Pk^iPk^j^ kk + Pk^iPm^i<^ km ■ 

k k — m— ±1 

In the last expression, we only keep the correlation between 
two adjacent redshift bins and neglect the correlations be- 
tween non-adjacent redshift bins (fl\k^m\>i)' This approxi- 
mation should be sufficiently accurate in practical applica- 
tions. Thus even if no photo-z error is presented, there is 
still an intrinsic (real) correlation between two different (es- 
pecially adjacent) redshift bins. If not accounted for, these 
non-zero C k ^ R will be mis-interpreted as a photo-z error 
and thus bias the reconstruction of p. 

We first attempt to quantify C k ?J R . Since we only need 
to evaluate Cj?S m at large scales where it is relevant, we can 
adopt the linear theory to calculate it through the following 
well known formula 

roc ji 

C™' R = J^ A 2 n (k,z = 0)™Q i (k,l)Q j (k,l) , (10) 
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where 



Qi{k,l) 



lzi-Azi/2 



D(z)b g (z)ji(kx)n(z)dz 



Zi+Azi/2 
-Azj/2 



n(z)dz 



(11) 



Unfortunately, since QiQj (i 7^ j) oscillates around zero and 
positive and negative contributions to the integral largely 
cancel, the numerical integration is very sensitive to nu- 
merical errors and is thus highly unstable. Nevertheless, 
through the Monte Carlo numerical integral, we believe that, 
at I 100, the cross power spectrum between redshift bins 
[0.0, 0.5) and [0.5, 1.0) falls below ~ 1% of the geometrical 
mean of the corresponding tw o auto correlation powe r spec- 
tra, confirming the findings of Schneid er et al.l l|2006l ). More 
accurate evaluation of the intrinsic cross correlation may be 
performed in real space, where we can avoid the highly os- 
cillating integrand encountered in the multipole space. This 
issue will be further investigated. 



The bias induced is roughly Spk- 
-199, R ir<9a,R 



r^99>R Iri 



99, R 
kk 



or 

Sp m ^ k ~ Cfcm /C^fnf- Depending on the low £ cut, this 
bias may become comparable to the statistical accuracy of 
self-calibration method. However, there are several possible 
ways to eliminate or reduce this bias. 

One way is to start with the last approximation of Eq. 
treat C^ 3 J R ^ =±1 as free parameters and fit them simul- 
taneously with other parameters. This can eliminate virtu- 
ally all the associated bias, with the expense of larger sta- 
tistical errors. We can easily figure out that there are still 
many more measurements than unknowns, so this remedy 
is doable. Furthermore, the degeneracy between C^ R , 1 



and p is weak. For example, C? 



induced by the scatter- 
-99>« ~ t>° On the other 



~<99,P 

ing pi_>j has the property C?^ /C?f" oc 
hand, the intrinsic cross correlation induced by the devia- 
tion from the Limber approximation decreases quickly with 
I and thus C?3' R /C?f' R decreases quickly with I. These dis- 
tinctive behaviors help to distinguish the intrinsic cross cor- 
relation from the one induced by photo-z errors. The char- 
acteristic behavior of C?3'j /C?f' R allows us to take priors 
which are weak, while still helpful to discriminate between 



C 



99, R 



/C?f' R and p. For example, we can set C^' R 



when 



t > ^Limber and thus reduces the number of extra unknowns. 
Alternatively, we can model Cfjj- /Cf?' R as a power law of 
decreasing power with respect to I. 

The inclusion of the lensing-galaxy cross correlation 
measurement also helps to break the degeneracy between 
C^3' R 7^ and p. The photo-z scatters induce both non- 



zertc^ andCf^ 



On the other hand, the failure of the 



-iGg,R 



7^ 0, since the 



Limber approximation does not cause C iK j 
lensing kernel vanishes. 

We then conclude that the intrinsic galaxy cross corre- 
lation between non-overlapping redshift bins may be non- 
negligible for stage IV lensing surveys. However, our self- 
calibration technique has the capability to take this com- 
plexity into account. Further investigation is required to 
quantify its influence on the self-calibration. 



4 POSSIBLE SYSTEMATICS 

There are some error sources which cannot be incorporated 
into our self-calibration technique without strong priors or 
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Figure 7. The systematic error induced in p by the galaxy dis- 
tribution bias b. Circles: statistical errors. Triangles: systematic 
errors (the absolute values). These are for LSST. For DES and 
CFHTLS, the systematic errors induced by the relative galaxy 
bias is sub-dominant and negligible. The toy model adopt for b 



takes the form bi- 



1 + s(zi — Zj) and we adopt \s\ = 0.1 for 



the result shown here. The induced bias scales as s. 



without external information. Thus they will bias the recon- 
structed photo-z scatters. We discuss the influence of the 
galaxy distribution bias in i|4.1l and the multiplicative error 
bias in cosmic shear measurement H4.2I 



4.1 The galaxy distribution bias 

A crucial assumption in the existing self-calibration tech- 
nique is that those galaxies scattering out of the true red- 
shift bin have the same spatial distribution as those remain 
in the true redshift bin. This implicit assumption can be 
inferred from Eq. [2] By straightforward math, we can find 
that <SF 



'' in this equation is actually 

f h ni(z)8 g A(z)dz 
f h m(z)dz 



(12) 



where the integral is over the fe-th redshift bin, rii(z) — 
J,p(z\z p )n(z F )dz p is the true redshift distribution of the 
i-th photo z bin, and <5 3 ,i is the overdensity for galaxies in 
this photo-z bin. In Eq.[2] there is an implicit approximation 

cS.R :S,R rS,H 
5 M =<Vfc =<V • 

This assumption is likely problematic. Those galaxies 
scattering from true redshift bin k to photo-z bin i could 
have either different redshift distribution (n(z)) or different 
clustering (b g ) or both compared to galaxies correctly iden- 
tified in photo-z bin k. Furthermore, the difference in n(z) 
means that these subcategories of galaxies do not sample 
the cosmic volume with identical weighting (despite sharing 
the same true-z bin), so they do not share exactly the same 
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Figure 8. The systematic error induced in the mean redshift 
by the galaxy relative bias b. Circles: statistical errors. Triangles: 
systematic errors. As a reminder, we adopt Si— >j = 1 + s(zi — Zj) 
and \s\ = 0.1. 




Figure 9. The dependence of systematic error induced in the 
mean redshift by the galaxy relative bias b on the size of redshift 
bins. Filled circles represent statistical errors and open triangles 
denote systematic errors. As a reminder, we adopt 6;->j = 1 + 
s(zi — Zj) and \s\ = 0.1. 



cosmic variance and thus do not have the identical clustering 
pattern. We call all these complexities as the galaxy distri- 
bution bias. 

This galaxy distribution bias certainly biases the Pi->j 
reconstruction. For example, if a subcategory of galaxies 
that did not cluster at all were to scatter, a cross galaxy- 
galaxy correlation would not detect those, and result in the 
incorrect leakage reconstruction. Interestingly, even for this 
extreme case of galaxy distribution bias, the galaxy-lensing 
correlation brings hope. For this subcategory of galaxies that 
scattered but did not cluster at all, other galaxies apparently 
behind of them may still be able to lens them and cause 
a detectable spurious foreground shear-background galaxy 
correlation. This example further demonstrates the gain by 
adding the galaxy-lensing cross correlation measurements in 
the self-calibration. 

Unfortunately, even with the aid of galaxy-lensing corre- 
lation measurement, the self-calibration still fails, if no priors 
are adopted. The galaxy distribution bias has not only a de- 
terministic component, but also a stochastic component, i.e. 
the noise induced by the different cosmic (sample) variance 
realizations for different photo-z bins. We show that, even if 
we can neglect the stochasticity, the degrees of freedom in 
the galaxy distribution bias kill the self-calibration. 

In this limit, the galaxy distribution bias can be 
completely described by the relative bias parameter 
namely the ratio of the bias between those scattered into 
the j-th redshift bin to those remaining in the i-th redshift 
bin, weighted by the difference in n(z). With the presence 
of the deterministic galaxy distribution bias, Eq. [3] and Eq. 
[5] become 



C?f' p ~ y^ff fc -Hfffc-).j6fc-Kftfc^jfffcf , (13) 

k 

C ?/' P = E Pk^iPm^b m ^C^ . (14) 

First of all, we notice a degeneracy in Eq. 1131 of the form 
bi^j x pi-tj (i 7^ j). The same argument helps to break 
the scaling invariance of Eq. [6] does not apply here, sim- 
ply due to many more free parameters involved here. The 
galaxy-lensing correlation measurements do help, since the 
scaling invariance in Eq. 1131 Pi-s-j — > bs-s-j X Pi-*j (i 7^ j), 
does not hold in Eq. 1141 Unfortunately, in general, b(£) are 
scale dependent and there are N(N Z (N Z — 1) of them, which 
nearly triple the number of unknown parameters, making 
the number of unknowns larger than the number of indepen- 
dent measurements and thus ruining the self-calibration. 

In reality, from the origins of the galaxy distribution 
bias, we expect that it is scale dependent and is unlikely de- 
terministic. Thus, we are not able to render it as a statistical 
error. Instead, we will live with it and quantify the induced 
bias in the reconstructed p. 

If we were to neglect this b (namely by assuming b — 
1), the reconstructed p would have a bias Sp ~ (b — l)p. 
To robustly quantify the induced bias in p, we need robust 
measurement or modeling of b, which we lack. To proceed, 
we adopt a toy model, 6i_>j = 1 + s(zi — Zj). The details 
of this calculation are shown in the appendix. The resulting 
bias in p scales as s. For the case of \s\ = 0.1, the result is 
shown in Fig.[7][S]&[IJ] We find that, for the most significant 
bias in p, it indeed satisfies the relation 8p ~ (b — l)p. For 
those p whose value is small, the dominant bias is induced 
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by the propagation from other parameters and thus do not 
follow this relation. 

Depending on the actual amplitude of this galaxy dis- 
tribution bias, this may be the dominant systematic error. 
It may also be non-negligible, or even dominant, comparing 
to the statistical errors in the reconstruction (Fig. [TJ [8] & 
0. There are possible ways to reduce it. By choosing finer 
bin size, we can reduce the galaxy distribution bias caused 
by the difference in n(z), at the expense of more pi-*j and 
bi->j parameters to constrain. With high-quality imaging or 
photometry we could further split galaxies into sub-samples 
of morphological or spectral types to reduce the difference in 
clustering strength. If eventually we can reach |s| < 0.1, the 
galaxy distribution bias will not be catastrophic, but still 
significant (Fig. [7]). 

We caution that this galaxy distribution bias also ex- 
ists in the calibration technique based o n cross correla- 
tions between photo-z and spec-z samples |Newmanl 120081 ; 
iBernstein &c Hutererll2009l ). I n principle, direct spec-z s am- 
pling of the photo-z galaxies jBernstein fc HuterefeOOi^ ) al- 
lows for direct measurement of the galaxy distribution bias 
including its stochasticity. Since the galaxy distribution bias 
has its own sample variance, the spec-z sampling must be 
sufficiently wide in sky coverage, deep in redshift and reach 
high completeness. These are crucial issues for further inves- 
tigation. 



4.2 The multiplicative error bias 

Due to incomplete PSF correction, shear measurement can 
have multiplicative errors and additive errors. The additive 
errors do not bias the self-calibration results, since they do 
not correlate with galaxies. However, the multiplicative er- 
rors, which renders 7 to (1 + /)7, can. If / is the same 
for those galaxies whose photo-z remains in the true-z bin 
and those galaxies scatter out of the true-z bin, it does not 
induce bias in the p reconstruction. However, in principle, 
these galaxies could have different multiplicative error. The 
multiplicative error could for example depend on the size of 
galaxies. If the photo-z error depends on some intrinsic prop- 
erties of galaxies, which correlate with the galaxy size, then 
the multiplicative error would vary across different photo-z 
samples with the same true redshift. In this case, Eq. [5] and 
1141 no longer hold. Eq. [14] should be replaced by 

C % 9,P = P*->iPm-«(l + Af k ^)b m ^C^ R . (15) 

Here, the parameter A/*_h = (1 + f k ^i)/(l + fk^k) - 1 ~ 
— /fc_>fe describes the relative difference in the mul- 
tiplicative errors of the two galaxy samples in the fc-th 
true redshift bin (one scatters to the i-th photo-z bin and 
the other remains in the fe-th photo-z bin). C^' R = (1 + 
fk^k)C^' R . Clearly, if A/ = 0, it does not bias thep recon- 
struction, since we just need to redefine C° 9 ' R . If Af 7^ 0, 
the bias induced in p is 5p ~ Afp. Current shape measure- 
ment algorithms control / to ~ 1 % levels with poten tially 
larger redshift or size dependence (iMassev et al.ll2007l ); but 
cosmic-shear analysis of future large surveys will require 
l/l ^ 0.1% if indu ced systematics are to be subdominant 
to statistical errors ( Hutereretal ] |2006l :lA mara fc Ref rcgicr 
2008). Anticipating future progress in shape measurement 



errors, we can expect an induced error < O.OOlp, which is 
sub-dominant to the one induced by the galaxy distribution 
bias and is thus likely negligible. However, in case that the 
shape measurement errors fail to reach the required accu- 
racy, the bias induced by the relative multiplicative error 
must be taken into account carefully. 



5 DEPENDENCE ON THE FIDUCIAL MODEL 

The error forecast depends on the fiducial model we adopt, 
including the galaxy properties and the survey specifica- 
tions. A thorough analysis over all uncertainties in the fidu- 
cial model is beyond the scope of the current paper. Instead, 
we will present brief discussions on several key issues. 

5.1 Dependence on the galaxy clustering 
properties 

In the above analysis, we have made a number of simplifica- 
tions. (1) We have adopted a scale independent and redshift 
independent galaxy bias for the error forecast. In reality, the 
galaxy bias is both redshift and scale dependent. As we have 
seen, the reconstruction does not require assumptions on the 
actual cosmology or clustering properties, so a variable bias 
is much like changing the background cosmology, which in 
principle is independent of the inferred scattering. (2) All the 
power spectra in Eq. O etc. are the ones in the observed 
cosmic volume. Due to the cosmic variance, they can differ 
from the ensemble averages that we adopt for the fiducial 
power spectra. As explained before, these uncertainties also 
affect the error forecast. Eventually we will apply this self- 
calibration technique to real data and thus will completely 
avoid the ambiguity in the fiducial model. 

Here we will address the impact of scale dependent bias. 
As we have mentioned in [JU the self-calibration with only 
galaxy-galaxy clustering heavily relies on the shape differ- 
ences between different Cf®(t). If the 3D galaxy clustering 
is close to power-law over a large scale range with redshift 
independent power index, the resulting C?f(£) are close to 
self-similar and thus the self-calibration with only galaxy- 
galaxy clustering will degrade significantly. The question is, 
can the full self-calibration, with the aid from galaxy-lensing 
cross correlation measurement, avoid this potential degrada- 
tion? 

To investigate this issue, we adopt an ad-hoc model for 
galaxy clustering, in which the galaxy bias is scale dependent 
such that 3D galaxy power spectrum (variance) takes the 
form 

A 2 g {k,z) = lQ 1 - 25 k 1 - Sb (l + z y 2 when k < k c , (16) 

and A 2 g (k,z) = A 2 g (k c , z) when k > fc c . If k c > 1/i/Mpc, at 
z — and 0.1/i/Mpc < k < 1/i/Mpc, it is close to the matter 
power spectrum (variance) A 2 n (k,z = 0) and thus close to 
the galaxy clustering with b g = 1. But it shows significant 
deviation from the matter power spectrum at other k and 
other redshifts. In the limit that k c —> 00, this galaxy power 
spectrum becoms a strict power-law and we expect that the 
self-calibration based on galaxy clustering alone fails. Nu- 
merically, we find that when k c > 7/i/Mpc, the Fisher ma- 
trix inversion based on galaxy clustering alone blows up, 
indicating its failure. 
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Figure 10. The dependence of self-calibration performance on 
the shape of galaxy power spectrum. We adopt a toy model of 
galaxy clustering, characterized as a power law with a break at 
k = k c . The shown value of k c is in unit of h/Mpc. The ad-hoc 
galaxy power spectrum is closer to a power law for larger k c . As 
expected, when the galaxy power spectrum is closer to a power 
law (with redshift-indepcndcnt power index), the self-calibration 
based on galaxy clustering alone degrades and eventually blows 
up, due to more severe degeneracy between up and down photo- 
z scatters. On the other hand, the full self-calibration virtually 
avoids this problem, since it heavily relies on the intrinsic lensing 
geometry dependence to break this degeneracy. 



Fig. 1101 shows the degradation when increasing k c . De- 
spite better galaxy clustering measurement due to stronger 
clustering strength, the self-calibration accuracy degrades, 
since the galaxy power spectrum is closer to a strict power- 
law and the degeneracy between up and down photo-z scat- 
ters becomes more severe. 

We expect that the full self-calibration is basically free 
of this problem, since it mainly relies on the lensing geom- 
etry dependence to break the degeneracy between up and 
down scatters — namely, a lens can only lens a galaxy be- 
hind it. This is indeed what we find numerically. To demon- 
strate this point, we assume the galaxy bias with respect to 
the matter density to be deterministic, thus the 3D matter- 
galaxy cross correlation power spectrum (variance) is given 
by 

A 2 mg (k,z) = VAfOM) x AUk,z) . (17) 

This quantity determines the galaxy-galaxy lensing power 
spectrum. The accuracy of the full self-calibration is shown 
in Fig. [TUJ The reconstruction accuracy only degrades 
slightly even when the galaxy power spectrum is very close 
to a strict power-law (e.g. k c = 6.5/t/Mpc). 

This extreme example confirms our expectation that 
scale dependence of the galaxy bias is unlikely to alter the 



major conclusions of this paper, namely the feasibility of our 
self-calibration technique. 



5.2 Dependence on other specifications 

The scaling of Pi^j reconstruction accuracy with fiducial 
quantities can be roughly understood as follows. The ob- 
served density-density correlations C tx p 2 b 2 A 2 n and the 
shear-density correlations C oc p 2 b g A 2 n , where A„ is the 
matter power spectrum (variance). These are the signals. 
For the noises in the correlation measurements, we have 
shown that shot noises in shear and galaxy number den- 
sity measurements are the only relevant ones, which scale as 
7rms(/sk y ?ig)~ 1/ ' 2 and (f s k y n 2 g )~ 1/2 respectively. 

Then if relying on galaxy-galaxy lensing measurement 
alone, the reconstruction error a p tx j^ ms . In combination 
with galaxy-galaxy clustering measurement, the dependence 
becomes weaker and we expect that a v tx 7r ms , where e £ 
(0,1). If we adopt a fiducial value 7 rms = 0.24 instead of 
0.2, the reconstruction accuracy will degrade by a factor of 
< 20%. 

For fixed galaxy distribution, matter power spectrum 
Aj,, and Pi->j, following similar argument above, we find 
that 

a P K /skJ /2 "-9 ^rms ■ (18) 

Here, the bias dependence d 6 (1,2) and, as a reminder, 
ee (0,1). 

It is interesting to quantify the performance of the self- 
calibration technique for surveys like CFHTLS and DES. 
Based on the above scalings, we are able to do an order of 
magnitude estimation. (1) One of the major differences be- 
tween these surveys and the fiducial stage IV survey, is the 
sky coverage. From the / s k y dependence alone, for CFHTLS, 
we expect a factor of 10 larger reconstruction errors, since 
the sky coverage is a factor of 100 smaller. For DES, we ex- 
pect the degradation to be a factor of 2. Since the statistical 
accuracy of these surveys scales exactly the same way with 
respect to the sky coverage /sky, the self-calibration tech- 
nique works equally fine for these surveys, from this view- 
point. (2) Another difference is that the number density of 
source galaxies in CFHTLS and DES is likely a factor of 
2 smaller. This results in another factor of 2 degradation 
in the reconstruction accuracy. Since the lensing measure- 
ment at £ < 2000 is not completely shot noise dominated, 
the statistical accuracy has weaker dependence on n g . From 
this viewpoint, the self-calibration technique works better 
for surveys with higher galaxy number density. 

We caution that the above estimation neglects many 
complexities. For example, CFHTLS, DES and Pan- 
STARRS are shallower than the fiducial survey and thus 
the galaxy number densities at high redshift in these surveys 
are likely much smaller, while the galaxy number densities 
at low redshift are comparable. This implies that the Pi->j 
reconstruction in these surveys at high-z is more affected 
that at low-z. Furthermore, the above estimation neglects 
the difference in the photo-z error, which is likely consider- 
ably worse for CFHTLS and DES. It will definitely affect 
the reconstruction. However, as we discussed in N2.4I the 
detection threshold of contamination rate (pi->j) is mainly 
determined by the ratio of noise and signal of correspond- 
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ing true-z bins. In this sense, the self-calibration technique 
works better for worse photo-z estimation. 

For a robust forecast of the performance of our self- 
calibration technique in each specific survey, we need a de- 
tail fiducial model of the galaxy distribution, clustering and 
photo-z error distribution. Although this task is beyond the 
scope of this paper, the above estimates imply that the self- 
calibration technique will be applicable. 



6 DISCUSSIONS 

It is possible to further improve the statistical accuracy of 
the self-calibration technique. For example, so far we treat 
Cfyj and C^yj as independent quantities. Improvement can 
be made by utilizing their internal connection. Both of them 
are determined by the same mass-galaxy cross correlation 
over the same redshift range. For this reason, they are con- 
nected by a simple scaling relation fin the absence of in - 
trinsic alignments) , as pointed out by I Jain fc Tavlorl (2003); 
IZhang et all (120051 ): iBernsteinl (|20od ). 

In weak lensing cosmology, people often disregard the 
lensing power spectrum measurement at I > 2000-3000, be- 
cause theoretical prediction at such scale is largely uncer- 
tain. However, the shear-shear measurement at such scales 
contains useful information to improve the photo-z recon- 
struction as well as shear-ratio information that is useful for 
cosmology. For example, they allow for better handle over 
the magnification and size bias induced correlations, mainly 
in the background density-foreground shear correlation mea- 
surement. We could use this information usually disregarded 
in cosmological applications to improve the photo-z calibra- 
tion. 

So far we only focus on the reconstructed Pi->j . The re- 
constructed C??' R and Cf>- R contain valuable information 
on cosmology and can be further explored. Since these re- 
constructed power spectra do not suffer from the problem 
of the photo-z scatters, they can be utilized for purpose be- 
yond cosmology. For example, t he self-calibra tion of galaxy 
intrinsic alignment, proposed bv lZhangl (|2008T ). relies on the 
measurement of Cf i a ' P to infer the GI correlation contami- 
nated the weak lensing measurement. This quantity is con- 
taminated by the photo-z scatters. Since we are now able 
to quantify the photo-z scatters and C? 9,ii simultaneously, 
we are able to quantify and correct for the effect of photo-z 
scatters in the self-calibration of galaxy intrinsic alignment. 

One key issue missing in this paper is to propagate the 
errors or biases on the reconstructed Pi->j into errors on 
cosmology inferred from shear-shear data. This will tell us 
whether the errors in the self-calibration are small enough 
to avoid significant biases or inflated errors in a shear-shear 
cosmology measurement. Although many of the p's are un- 
certain by more tha n the ~ 0.1% that is neede d to make 
the biases negligible (|Bernstein fc Hutererl 12009), there are 
many correlations between these errors which complicate the 
estimation. This issue definitely deserves further investiga- 
tion. 

We emphasize that our purely photometric self- 
calibration technique is complementary to those based 
on c ross correlations between phot o-z and spec-z sam- 
ples (|Newmanll200ll IBernsteinl l2009h . which we call cross- 
calibration. An advantage of cross-calibration is that it can 



identify a special type of photo-z error, namely when the 
mean photo-z is a monotonic increasing function of true-z 
(other than the correct identity function) . As explained ear- 
lier, the self-calibration technique fails completely for such 
type of photo-z error. The cross-calibration technique will 
also likely be more able to infer galaxy distribution bias 
caused when photo-z outliers have a different n(z) than other 
galaxies in the same true-z bin. On the other hand, the self- 
calibration has a number of advantage over the cross corre- 
lation. Since the total number of photo-z galaxies is much 
larger than that of the spec-z sample, it can reach higher 
statistical accuracy. Since what it measures is the photo- 
z scatters in the whole survey volume, it avoids possible 
cosmic variance in the photo-z scatters, which could bias 
the cross-calibration. And since the spec-z targets may be 
a very different population from the photo-z galaxies, the 
cross-correlation method will be more susceptible to biases 
from varying b g among subpopulations. 

The photo-z scatter self-calibration method described 
here has many attractive aspects: it does not depend on any 
cosmological priors or on models of the power spectrum; it is 
unaffected by intrinsic alignments; its errors are determined 
by shot noise, not sample variance, so that higher source 
densities are exploited if observed; it remains Gaussian and 
tractable to high £ ; and is not significantly affected by shear 
measurement errors. And of course it can be conducted with 
the same imaging data used for the shear-shear correlation 
measurement without degrading the shear-shear information 
content. The Fisher analysis suggests that photo-z outlier 
rates can be determined with statistical errors of 0.01-1% for 
bins at z ^ 2. It will be necessary to correct data for lensing 
magnification bias, and in principle this can be done with 
little statistical penalty, but the magnification bias factor 
g must be determined to sufficient accuracy. The biggest 
issue is "galaxy distribution bias," whereby photo-z outlier 
galaxies might have different n(z) or bias b g than the non- 
outlying galaxies in the same true-z bin. We find that the 
effective bias of galaxies must vary by < O(0. 1) in order to 
avoid systematic errors in scattering rates that exceed the 
expected statistical errors. This is an area deserving of more 
detailed attention. 

In this paper, we have presented a concept study of the 
proposed self-calibration. For the idealized survey of stage 
IV survey specifications, we have shown that it can in princi- 
ple reconstruct the photo-z error distribution to useful pre- 
cision. More robust forecast shall include all extra sources 
of error, as listed in i|3]and Sj4] more realistic fiducial model 
and possibly more uncertainties, into account. 



7 ACKNOWLEDGMENT 

We thank Hu Zhan, Sarah Bridle and Jun Pan for many 
useful discussions and the anonymous referee for many use- 
ful suggestions. PJZ thanks the hospitality of the UPenn 
physics and astronomy department and the Aspen center for 
physics, where part of the work was done. PJZ thanks the 
support of the one-hundred talents program of the Chinese 
academy of science, the national science foundation of China 
(grant No. 10533030, 10543004, 10821302 & 10973027), the 
CAS grant KJCX3-SYW-N2 and the 973 program grant 
No. 2007CB815401. GMB acknowledges support from grant 



Photo-z self- calibration 15 



AST-0607667 from the National Science Foundation and De- 
partment of Energy grant DOE-DE-FG02-95ER40893. 



REFERENCES 

Report of Dark Energy Task Force. Andreas Albrecht, et 

al. |arXiv:astro-ph/0609591| 
Amara, A., & Refregier, A. 2008, MNRAS, 391, 228 
Bacon, D. J., Refregier, A. R., & Ellis, R. S. 2000, MNRAS, 

318, 625 

Bernstein, G. 2006, ApJ, 637, 598 

Bernstein, G. M. 2009, ApJ, 695, 652 

Bernstein, G., & Huterer, D. 2009, larXiv:0902.2782l 

Fu, L., et al. 2008, AAP, 479, 9 

Heitmann, K., Ricker, P. M., Warren, M. S., & Habib, S. 

2005, ApJS, 160, 28 

Heitmann, K., White, M., Wagner, C, Habib, S., & Hig- 

don, D. 2008, larXiv:0812.T052l 
Heitmann, K., Higdon, D., White, M., Habib, S., Williams, 

B. J., & Wagner, C. 2009, larXiv:0902.0429l 
Heymans, C, et al. 2006, MNRAS, 368, 1323 
Hirata, C. M., et al. 2004, MNRAS, 353, 529 PRD, 70, 

063526 

Hirata, C. M., Mandelbaum, R., Ishak, M., Seljak, U., 
Nichol, R., Pimbblet, K. A., Ross, N. P., & Wake, D. 2007, 
MNRAS, 381, 1197 

Huterer, D., & Takada, M. 2005, Astroparticle Physics, 23, 
369 

Huterer, D., Takada, M., Bernstein, G., & Jain, B. 2006, 

MNRAS, 366, 101 
Jain, B. 2002, ApJ, 580, L3 

Jain, B., & Taylor, A. 2003, Physical Review Letters, 91, 
141302 

Jain, B., & Zhang, P. 2008, PRD, 78, 063503 

Jing, Y. P., Zhang, P., Lin, W. P., Gao, L., & Springel, V. 

2006, ApJ, 640, L119 

Joachimi, B., & Bridle, S. L. 2009, larXiv:0911. 24541 
Jouvel, S., et al. 2009, larXiv:0902.0625l 

Kaiser, N., Wilson, G ., & Luppino, G. A. 2000, 
|arXiv:astro-ph/0003338l 

Ma, /.. Hu, W., & Huterer, D. 2006, ApJ, 636. 21 
Mandelbaum, R., Hirata, C. M., Ishak, M., Seljak, U., & 

Brinkmann, J. 2006, MNRAS, 367, 611 
Massey, R., et al. 2007, MNRAS, 376, 13 
Newman, J. A. 2008, ApJ, 684, 88 

Okumura, T., Jing, Y. P., & Li, C. 2009, ApJ, 694, 214 
Okumura, T., & Jing, Y. P. 2009, ApJ, 694, L83 
Pen, U.-L. 2004, MNRAS, 350, 1445 

Rudd, D. H., Zentner, A. R., & Kravtsov, A. V. 2008, ApJ, 
672, 19 

Schmidt, F., Rozo, E., Dodelson, S., Hui, L., & Sheldon, 

E. 2009. larXiv:0904.4702l 
Schneider, M., Knox, L., Zhan, H., & Connolly, A. 2006, 

ApJ, 651, 14 
Seljak, U., & Zaldarriaga, M. 1996, ApJ, 469, 437 
Seljak, U. 2009, Physical Review Letters, 102, 021302 
Smith, R. E., et al. 2003, MNRAS, 341, 1311 
Van Waerbeke, L., et al. 2000, AAP, 358, 30 
White, M. 2004, Astroparticle Physics, 22, 211 
Wittman, D. M., Tyson, J. A., Kirkman, D., DelP Antonio, 

I., & Bernstein, G. 2000, Nature, 405, 143 



Zehavi, I., et al. 2004, ApJ, 608, 16 
Zhan, H., & Knox, L. 2004, ApJ, 616, L75 
Zhan, H., & Knox, L. 2006, ApJ, 644, 663 
Zhan, H. 2006, Journal of Cosmology and Astro-Particle 
Physics, 8, 8 

Zhan, H., & Knox, L. 2006, |arXiv:astro-ph/0611159| 
Zhang, J., Hui, L., & Stebbins, A. 2005, ApJ, 635, 806 
Zhang, P. 2008, larXiv:0811.0613l 



16 Zhang, Pen & Bernstein 



APPENDIX A: THE LIKELIHOOD ANALYSIS AND THE FISHER MATRIX LAYOUT 



We work on the likelihood of the band angular power spectra to quantify the self-calibration error. As explained in Sj2] the 
error estimation here is distinctly different from that in the routine exercises of cosmological parameter constraints and the 
only relevant noise term is the shot noise. This makes the error estimation much simpler. Here we recast this argument in a 
more formal way. 

The data we have are a set of measured power spectra contaminated by shot noise, C D = C p + 5C S . Here, the superscript 
"D" denotes the data. SC S is the fluctuation in shot noise. C p are the power spectra in photo-z bins, which are related to 
C R by a set of pt^j through Eq. [3]&[5] We want to know the likelihood function P(pi^j \C D ). Since the distribution of C D 
is completely determined by and C R (given the shot noise distribution), Bayes' theorem states 



P(pi 



We then have 



,C R \C D ) ocP(C D \ Pi 



> ] ,C R )P{p^ J ,C R ) = P{C D \ 



H^,C R )P( Pi ^)P(C R ) 



P(pi 



\c D ) = 



P(p^,C R \C D )dC R 



P(C D \ Pi . 



,C R )P{ Pl . 



^)P(C fl )dC fl 



(Al) 



(A2) 



For Pi-i-j, we only exert the condition '^ li Pi^j = 1- Since we do not take any cosmological prior, we set the prior P(C R ) = 1. 
Then Eq. IA2I means to marginalize over all possible value of C R . This is what we have done in the paper. Notice that 
P(C D \pi->j , C R ) is completely determined by the shot noise distribution, given C R and Pi->j, the parameters that we want to 
fit. That is the reason that we state that only shot noise shows up in the error analysis. And, since we do not have real data, 
we have to arbitrarily choose C D , which could differ from the cosmic mean, due to the existence of cosmic variance. This 
freedom is where the cosmic variance shows up. In the paper, we follow the common choice and set C D = (C R ), the ensemble 
average. But this is just a representative case of the real data. Namely, C D can be any other value in the range allowed by 
the cosmic variance and the shot noise, as discussed in one of the footnotes in i ]2.4l 

In Eq. IA2I we can take stronger prior on C R . In principle, if we know the cosmology, we can write down the full PDF of 
C R . In reality, we do not know exactly the cosmology and we do not know exactly the PDF in the non-linear, non-Gaussian 
regime. Nevertheless, these priors will improve the reconstruction accuracy of Pi->j. However, as we show in the paper, the 
systematical error induced by galaxy distribution bias may have already been the dominant error source, improvement over 
statistical error is not necessary at the current moment. 

Since the fluctuations are induced by the shot noise, the central limit theorem implies that we can approximate the 
probability distribution of the power spectra P(C D \pi^,j, C ) as Gaussian, as long as there are sufficiently large number of 
independent £ modes in each multipole £ bin (£A£f B ] i y 3> I). Since we only use those modes with I > 100 and Al 3> 1, 
the probability distribution of the power spectra should be Gaussian for / s k y > 0.01. Furthermore, the covariance matrix is 
diagonal, namely, different power spectrum measurements are uncorrelated, since the associated shot noises are uncorrelated. 
The covariance matrix between the power spectra C^j- and C^' P , with (a), (/?) = gg, Gg, is 



(«), 03) 



C 



with 



,(°0 



J(a)GS) 



<5n 



t„99\2_ [47r/ sky /iVi] [47r/ sky /Nj] Gg a 

' ~ 2£A£f sky [1 + 0lj) ' ^ ' 

The logarithm of the likelihood function is 



2£A£f Biiy 



InL : 



IE 



E 



'(£) 



r<s9,p 



(Cf/' D (£) 



C 



Gg,P 



+ const. 



(A3) 



(A4) 



(A5) 



Here, Cff' P and C% 9 ' P are the model prediction, given by Eq.[3]&[5l If the model is unbiased, we can set C 
c a g ,D _ c Gg,p whgn evamatmg t h e Fisher matrix F i3 = d 2 In L/dXidXj i. We then have 



gg.D 



c 



gg,P 



and 



E 



E 



dC^ p (£) dCff' p {£) 



gg,P( 



8\r, 



_ 2 dCff> p {l) dC^ p {£) 



dX k 



dX n 



(A6) 



Cf?' P and Cj?' P are equivalent, so we only need to sum up pairs with i ^ j for the galaxy-galaxy correlation. On the other 
hand, for the lensing-galaxy cross correlation, we should sum up all pairs (in this case, ij and ji pairs are asymmetric). The 



unknown parameters to be determined are A = (p M _>„, C^ 9 ' R (£i 



,C 3 k 3 k > R (h 



), with (i v, i ^ j and fx,L/,i,j,k = 1, . 



Here, £1,2... denote the multipole £ bins. The Fisher matrix can be decomposed as follows, 



F = 



A B 
C D 







(A7) 



Photo-z self- calibration 17 



Here, A = F P(J p . It is the sum of A^, the contribution from each £ bin. B = F p c fl2V2 and C = B T . B can 
be decomposed as B = (Be 1 , Bf 2 , . . .), where B^ is the contribution from each £ bin. D = PD wn c w ^. Since different £ 
bins do not correlate, D is block diagonal and denote each block element as D^. The exact definitions of At, B^, Ce and 
T>i can be found by comparing Eq. IA5I and |A7I The dimension of A and A; is N Z (N Z — 1) x N Z (N Z — 1). That of B^ is 
N Z (N Z - 1) x N Z (N Z + 3)/2 and that of D £ is N 2 (A Z + 3)/2 x A 2 (A" Z + 3)/2. The inverse of F is 



/ (A-BD-C)- 1 — (A — BD _1 C) 1 BD 1 \ 

I -D- 1 C(A-BD _1 C) 1 D 1 + D : C (A BD : C) 1 BD" 1 J 1 j 

For convenience, we define 

Reduced _ A _ BD -i c = ^ [A e - BfD^C^] = AT Auccd . (A9) 

t t 

The second expression holds because D is block diagonal. The reduced matrix A roduccd determines the reconstruction accuracy 
of pi-t-j. The last expression simply means that, since £ bins are uncorrelated, we are able to sum over the contribution from 
each of them and improve the reconstruction accuracy. We know that constrains on the photo-z scatters mainly come from 
Cf*f and C^f. So we can further decompose A e = Af in + AA £ , where A™ in is the contribution from Cf°f and Cf^f 

and AAe is the contribution from C^: p and Off.'? . For clarity, we provide the expression of AAi, 

— - E W«r S^l? +E (W . . 

Here, A fc , A m 6 . 

In the limit p M7 ^i/ — > (equivalently p^u — > 5^), we have AA< — > BfD ( T 1 Cf. The proof is as follows, since Pi->j = 5ij, 
D< is diagonal, with the diagonal elements D„g<, „g. = (o- G9 „, )~ 2 and D^ss . n gg = (cr?, s „, ) -2 . We then have 

Thus under the limit that Pi^j — > 0, A rcduccd = A mln and the the error matrix of p is [A™ 11 ] - . This result has a clear physical 
meaning. We know that the measurements Cfl ' p and C^! p are most responsible for determining Cfl' p and Cf> - R and the 

measurements C?!'^ and Cf^ P are most responsible to determine p. Under the limit Pi^j — > 0, we are able to have a clear 
separation of the two sets of measurements. 

We caution that with the presence of photo-z scatters, this is no longer the case. Numerically, we find that the presence 
of photo-z scatters make the statistical error in p reconstruction, larger than the above limiting case. 

We now derive the detailed expression of all the matrices. A useful relation in this exercise is that <9pfc_>i/9p M ^„ = 
5i„(<5fc M — <5fc„). The last term shows up because pi_>, = 1 — X)fc^iP fc - >i an< ^ OIU y Pm#" are independent variables. We then have 



dp^ v 



Si, J2 ( C< ^ R - C ^' R ) + ^ E ( C ^' R - P*-« s foAffi + 6 jv B% ■ (A12) 



ft; 

Clearly, we have A 99 vi — B 99 vi . But there is no such relation for the G-g terms. Finally we have 

(a) 



A™^ 1 _ > „ 1 , p „ a _ > „ a = [(<t£^ 2 ) 2 A { ^ vlV2 B^ 2 l 2V1 H(u2 - vi) + (o-^'J 2 B ( ^ vlU2 A^ V2V1 H{yi - v 2 ) 



2 A {a) A ia) S 4- V frr <a A 2 R (Q) R (a) S 
j>"i i<v\ 



(A13) 



A a . x ( rT (gg)\ 2 v I a (a a) > u(aa) \( A(aa) , d(ss) \ 

+ {^l)~ 2 Af^Bf 2 i> 2V1 H{v, - + (a(f«»)- 2 B^^A^fT^ - ux) 

4- V ( ff (Gs) r 2 4 (G9) /1 (G9) + V frr (G9) V 2 R (Gs) R (G9) fA14^ 
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r .Gg 



Gg^ 

3> 



l A u 



1J'Pm2 
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(A15) 



B, 



99 



E«) 



?Pm2 



(A16) 
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D fC G 3 c Gg = }{off) 2 Pvi^iPvi^jPv2^Pv2^j > ( A17 ) 

and 

Here, //(a;) is the Heaviside step function, whose value is 1 when x is positive, otherwise zero. In the above equations, all the 
power spectra at the right hand sides are evaluated at the corresponding I. 

We now only need to invert the matrix De and A rcduccd . This is a much easier job than the inversion of the full Fisher 
matrix F. More than that, it improves the numerical stability and accuracy of the inversion, which we can check at each step 
according to Eq. lA8l fe lA9l Numerically, we do not find any numerical instability or singularity for all the configurations that 
we have tested. The reconstruction accuracy of p is solely determined by A reduccd and the results are shown in Fig. [3] 

Alternative to this detailed description of the photo-z errors, we can evaluate a single convenient number, the density 
weighted true redshift of the i-th photo-z bin. It is defined as 

r°° zrii{z)dz f.znAz)dz 

<*> = Jo N {> = Jj r . (aw) 

It is not trivial to robustly evaluate (zi). The special type of photo-z error, in which the photo-z is a monotonic increasing 
function of the true-z, discussed in §j2j is one of the problems. However, even if such error does not exist, there is still another 
difficulty. We can measure robustly. However, it only tells us J.m(z) = pj^iNi. Without proper parameterization, we 
can not derive rn and thus can not robustly evaluate (zi). In reality, for those stage IV lensing surveys, the function m(z) 
should be sufficiently smooth. This implies that we can interpolate pj->i to model rii(z). The simulation data that we used 
does not have sufficient amount of galaxies allowing us to do this exercise. Instead, we will try two approximations to evaluate 

(*>■ 

One possibility is the following approximation 

3 3 

where {zf ) is the density weighted average photo-z of the i-th photo-z bin. The other possibility is to replace {zf) by the 
middle-point of this redshift bin. Through our simulation data, we can check the accuracy of the above two approximations. 
We find that the first approximation is often better. The statistical error in (zi), due to statistical errors in Pi^j is then 



^ ({zf) - <*f >)«**> - (*f )) <*Pi-ri*Pi-i) = Z)«^> ~ <*f - (*f • (A21) 

jk jk 



By far we have assumed that Pi_>j can be any real value. But of course, ^ Pi->j ^ 1- This condition can be straightforwardly 
enforced by p — > (tanh(x) + l)/2, where x can be any real value. Furthermore, there are conditions that C?f' R > 0, which 
can be automatically satisfied by the transformation C 33 ' R — > exp(y), where y can be any real value. Given the small errors 
of stage IV lensing projects, these extra conditions are not likely able to improve the reconstruction accuracy significantly. 
However, for stage 3 projects like CFHTLS, these conditions should be explicitly enforced to improve the reconstruction 
accuracy. 



APPENDIX B: THE INDUCED BIAS 

The self-calibration is based on the validity of Eq.[3]and[S] Systematic deviations from these equation then induce systematic 
errors in Pi->j. Here we outline the calculation of such bias. The bias in reconstructed A is SX = \ F — \ T . \ F is determined 
by requiring dlnL/dX\ X F = 0. Taylor expanding this equation at A T and keeping terms up to 0(8X), we obtain a set of linear 
equations on AA and its solution 

FaSXj = J» ; 5X, = Frfjj , (Bl) 



where 



E^)^ c "ir+E^T 2 ^i- • (B2) 



Here, SC°j = C"j' D —C^' P (X T ) is the model bias in the corresponding power spectrum. For example, for the galaxy distribution 

bias, we have 5C?f = Y^ k Pk^iPk^j(bk^-ibk^.j - l)C£f and SCf 3 = ^ k>m pk->iPm.-n(bm^n - Noticing that, since 

8X oc SC, we have also neglected t erms Q(5C x 8X) when de r iving this equation. All differentials in Eq. IB2l are evaluated at 
A T . This result agrees with that in lHuterer fc Takadal (|2005h : iHuterer et al.l (|2006l ). The related matrices are calculated from 
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the following expressions, 



J C 3 = ^((Tg^-^C^^^ , 

and 

The induced bias in individual p*-»j is shown in Fig. [7] The induced bias in the mean redshift is 
Szi ~ 2^((«f ) - {zf))5pj^i . 

3 

The numerical results are shown in Fig. [8] & [9] 



